A PROOF OF DEJEAN'S CONJECTURE 



JAMES CURRIE AND NARAD RAMPERSAD 



Abstract. We prove Dejean's conjecture. Specifically, we show that Dejean's conjecture 
holds for the last remaining open values of n, namely 15 < n < 26. 



1. Introduction 

Repetitions in words have been studied since the beginning of the previous century [16|[TT]. 
Recently, there has been much interest in repetitions with fractional exponent [H El El [71 El 
IT] . For rational 1 < r < 2, a fractional r-power is a non-empty word w = pe such that 
e is the prefix of p of length (r — l)\p\. We call e the excess of the repetition. We also say 
that r is the exponent of the repetition pe. For example, 010 is a 3/2-power, with excess 0. 
A basic problem is that of identifying the repetitive threshold for each alphabet size n > 1: 

What is the infimum of r such that an infinite sequence on n letters exists, 
not containing any factor of exponent greater than r? 

This infimum is called the repetitive threshold of an n-letter alphabet and is denoted 
by RT{n). Dejean's conjecture [6] is that 

n = 3 

BT(n) {l/~>. n = 4 




-1), n^3,4. 

Thue, Dejean and Pansiot, respectively [IIl[6l[T4j, established the values RT(2), RT(3), 
RT(A). Moulin Ollagnier [T3] verified Dejean's conjecture for 5 < n < 11, and Mohammad- 
Noori and Currie [T2] proved the conjecture for 12 < n < 14. Recently, Carpi |3J showed that 
Dejean's conjecture holds for n > 33. The present authors strengthened Carpi's construction 
to show that Dejean's conjecture holds for n > 27 [U [5]. In this note we show that in fact 
Dejean's conjecture holds for n > 2. We will freely assume the usual notions of combinatorics 
on words as set forth in, for example, [9]. 

2. MORPHISMS 

Given previous work, it remains only to show that Dejean's conjecture holds for 15 < n < 
26. This follows from the fact that the following morphisms are 'convenient' in the sense of 
[T3] . To make our exposition self-contained, we demonstrate in the remainder of this paper 
how these morphisms are used to prove Dejean's conjecture for 15 < n < 26. We introduce 
several simplifications and one correction to the work of Moulin Ollagnier |13j . 
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his 
his 

hie 
hie 

hir 
hir 

his 
his 

hig 
hig 

h20 
h20 

h2i 
hn 

h22 
h22 

h23 
h23 

h24 
h24 

h2S 
h2S 

h26 
h26 



01101101011011011011010101101010110110110110110101101101 
10101011011011010110101101101011010110110110101101010101 

101010110110110101101101101010101010101010110101010101010101 
011010110110110110101101101010110110101010110101010101010101 

1010101010101010101101101011011010101010101011010110110110101101 
1010101010101010101101101101101010101010110110110110110110110110 

10101010101101101101010110110101011011011010101011010110110101010101 
01101010101101101010110110110101011011011010101011010101010101010101 

101010101010101010101101101010110110101010101010101101011010110110101101 
101010101010101010101101101011011010101010101011011011011010110110110110 

1010101010101010101011011011010101010101011011011011011010110101011011010101 
1010101010101010101011011011010101101101011011011011010110110101011011010110 

101010101010101010101011011010101010101101101010101010110110110110101010101010101101 
101010101010101010101011011010101010110110101010101010110101101010101010101010110110 

101010101010101010101011010101010101010101010101101101101101101011011011011011010101 
101010101010101010101011010101010101011011010101101101101101011011011011011011010110 

1010101010101010101010101010101010101011011010110110110110101011010110110110110110101101 
1010101010101010101010101010101010101101101010110110110110110110110110110110110110110110 

10101010101010101010101011010101101101010101010101011010101010110110101101101101011011010101 
10101010101010101010101011010101101101010110110101011010101010110101101101101101011011010110 

101010101010101010101010101011011010101011011010110110110101101101011010101010101011011010110110 
101010101010101010101010101011011010101010110110110110110101101101101101101010101011011010101101 

1010101010101010101010101011010101010101101101010101010110110110101101011010110110110110110110110101 
1010101010101010101010101011010101010101101101101101010110110110101101010110110110110110110110110110 



We remark that the last letter of h n (0) is different from the last letter of h n (l) in each 
case. We also note that for each n, \h n (0)\ = 4n — 4, except for n — 21 where we have 
\hJ0)\=4n. 



3. Maximal repetitions 

For each h n of Section [21 word 011 is a factor of h n (0) and 110 is a factor of h n (l). It 
follows that |/i™(l)| becomes arbitrarily large as m increases, and that every factor of h%(0) 
is a factor of /i™(l) for some m. 

Let an occurrence of v in h^(0) be written h^(0) = xvy. Suppose that v has period q. 
We can write x = x'x", y = y'y" such that x"vy' has period q, and \x"vy'\ is maximal. This 
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is possible since every factor of ft^(O) is a factor of h™(l) for some m, and word 1 has two 
distinct left extensions 01 and 11, and two distinct right extensions 10 and 11. We refer to 
x"vy' as the maximal period q extension of the occurrence xvy of v. 

4. Pansiot encoding 

Fix n > 2. Let S n = {1,2, ... ,n}. Let v G X* have length m > n — 1, and write 
f = V\V2 ■ ■ -v m , Vi G E n . In the case where every factor of v of length n — 1 contains n — 1 
distinct letters, we define the Pansiot encoding of v to be the word b(v) = b\b 2 ■ ■ ■ & m -(n-i) 
where for 1 < i < m — n + 1 

0, Vi = u i+n _i 

1, otherwise. 



6; 



We can recover t> from 6(f) and viv 2 ■ ■ ■ f n -i- We see that v has period q if and only if 

\v\ — 71 -\- 1 

b(v) does. The exponent \v\/q of t> corresponds to an exponent of b(v). 

Q 

Let S n denote the symmetric group on S n with identity id and left multiplication, i.e., 

(fg)(i) = f(g(i)) for f,g £ S n ,ie s n . 

Let a : {0, 1}* — > S n be the semigroup homomorphism generated by 

1 2 ••• (n-2) (n-1) n 



a(0) = 

(7(1) = 

One proves by induction that 
(1) a(b(v)) = 



2 3 •■• (n-1) 1 n 

1 2 • ■ • (n-2) (n-1) n 

2 3 •■• (n-1) n 1 



1 2 ■■■ (n-2) (n-1) n 

Vm—n+2 ^m— n+3 ' ' ' ^m— 1 V m V 

where v is the unique element of S \ {v m , u m -i, . . . , f m -n+2}- 

Suppose that P.E G S* is a repetition of period q = \P\ > with > n — 1. It follows 
from ([1]) that a(b(P)) = id; i.e. that P is in the kernel of a. We refer to b(PE) as a kernel 
repetition of period q. Conversely, if u G S* and 6(u) is a kernel repetition of period q, 
then we may write u = PE = EP' for some words P, P', E where |P| = |P'| = g. 

Suppose that for a morphism ft, : {0, 1}* — > {0, 1}* there is a r G such that 

r ■ a(/i(0)) ■ r- 1 = a(0) 
t ■ a(h(l)) ■ r- 1 = (7(1) 

In this case we say that ft satisfies the 'algebraic condition'. 

5. Kernel repetitions with markable excess 

Let a uniform morphism ft : {0,1}* — ► {0,1}* be given. Let |ft(0)| = r > 0. A word 
v G {0, 1}* is markable (with respect to ft) if whenever h(X)xv and h(Y)yv are prefixes of 
ft w (0) with \x\, \y\ < r, then x = y. If a word is markable, its extensions are markable. Let 
U be the set of length 2 factors of ft w (0). A word v G {0, 1}* is 2-markable (with respect 
to ft) if whenever 

(1) u, u' G U, 

(2) h(X)xv is a prefix of h(u) with |x| < r, and 
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(3) h(Y)yv is a prefix of h(u') with \y\ < r, 
then x = y. 

If \v\ = r and v is a factor of h w (0), then f is a factor of h(u), some u EU . It follows that 
if f is 2-markable, then v is markable. For each n, if /i = fo n , we find {/ = {01, 10, 11}. It 
follows that all length r factors v are factors of /i(0110). A finite check shows that if = r 
and v is a factor of /i w (0), then t> is 2-markable, hence markable. 

Let n be fixed, 15 < n < 26 and let h = h n . One checks that /i satisfies the algebraic 
condition. Suppose that v = pe is a kernel repetition with period g = where h w (0) = xvy. 
Notice that every length q factor of pe is conjugate to p, by the periodicity of pe. It follows 
that every length q factor of pe lies in the kernel of a. Suppose that the excess e of v is 
markable. Let V = x"vy' be the maximal period q extension of the occurrence xvy of v. 
Write x = Xx', y = y'Y, so that h u (0) = XVY. Write V — PE — EP' where \P\ = q. 
Since E is an extension of e, E is markable. Write X = h(x)x' where \x'\ < r and write 
XP = ^(7)7' where |7'| < r. It follows from the markability of E that \' = 7'. Then the 
maximality of V yields \x'\ = |7'| = 0. We may thus write X = h(x), E = h(i])i]', with 
I rj' I < r. By the maximality of V, word 77' must be the longest common prefix of h(0) and 
h(l). Since E is a prefix and suffix of PE and E is markable, we know that r divides |P|. 
In total then, we may write XPE = h^x^v) 1 ]' where h(ir) = P, and 77 is a prefix of ir. Also, 
since h satisfies the algebraic condition, <j(ti) = id. Thus irrj is a kernel repetition in /i w (0). 
We see that \PE\ = r\7ir]\ + |??'|. 

The maximality of V implies that irr] is maximal with respect to having period |7r|. This 
means that if r\ is markable, we can repeat the foregoing construction. Eventually we obtain 
a kernel repetition VS with non-markable excess £. If it takes s steps to arrive at VE then 
we find that \PE\ = r s \V£\ + \i\ r * and l p l = rS \ v V 

6. Main result 

Let n be fixed, 15 < n < 26 and let h = h n . Suppose that u\ is a factor of /i w (0) with 
I Mi I = £. Extending u\ by a suffix of length at most r — 1, and a prefix of length at most 
r — 1, we obtain a word h(u 2 ), some factor u 2 of /^(O), where |u 2 | < [(£ + 2(r — \))/r\. 
Repeating the argument, we find that u\ is a factor of h 2 (us), some factor w 3 of /^(O) where 



L(£ + 2(r- l))/rj +2(r-l) 
r 

Define 

/(€jr)= L(^ + 2(r-l))/rJ+2(r-l) _ 

Let w be the cu-word over S n with prefix 123 • • • (n— 1) and Pansiot encoding 6(w) = /^(O). 
We will show that w contains no (^-j-) + -powers. Suppose to the contrary that pe is a 
repetition in w with \pe\/\p\ > n/ (n — 1) and e a prefix of p. 

First suppose that |e| > (n — 1). Let PE = b(pe). Then PE is a kernel repetition. Let 
7/ be the longest common prefix of h(0) and h(l). As in the previous section, replacing pe 
and PE by longer repetitions of period \P\ if necessary, we may assume that /^(O) contains 
a kernel repetition P£ with non-markable excess S such that \PE\ = r s \V£\ + \r)'\ Ei=o r * 
and |P| = r s \V\. 
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(2) 



l M 3| < 



We find that 

1 n 

1 + 



n — 1 n — 1 



< 



|P£| 


+ n - 1 




|P| 









=0 r l + n — 1 




v\ 







P| + r s |£j 


- + - 


v'\ 




-.\ r 1 














v\ 





+ 



r s \V\ 



so that 



< l + VB\\ S +W T + n-l 

VP \ r — 1 



|P| < (n-1) ^| + |V|^Y+n-lj 



and 



|P£| < + ( + ^— + n- 

\ r — 1 

(r 
r + (r — 1) hn- 
r — 1 

< 4n + (n-l)(9n- 1) 
= 9n 2 — 6n + 1. 

We use that |£| < r (since all factors of /i w (0) of length r or greater are markable) and 
r < An (as observed in Section [2]). Finally, since n' is a proper prefix of h(0), \n'\ < r. 

One verifies that I(9n 2 — 6n + 1, r) — 2. Since every length 2 factor of /^(O) is a factor 
of 0110, word b(PE) must be a factor of /i 2 (0110). Let t> be the word of S n with prefix 
123 • • • (n — 1) and Pansiot encoding /i 2 (0110). Since b(PE) is a kernel repetition, word v 
contains a repetition pe with |e| > n — 1. However, a computer search shows that v contains 
no such repetition. 

We conclude that |e| < n — 2. In this case, 

n \P e \ ii ii 

< — - \e\n > \pe\ 



n—1 \p\ 

=>- (n - 2)n - (n — 1) > \b(pe)\ 
n 2 — 3n + 1 > |6(pe)| 

However, n 2 — 3n + 1 < 9n 2 — 6n+ 1, so that again b(pe) must be a factor of /i 2 (0110), and u, 
defined as in the previous case, must contain a (-^J -power. However, a computer search 

shows that word t> is (^-) + -power free. 
We have proved the following: 
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Main Result: Let w be the word over S n with prefix 123 • • • (n — 1) and Pansiot encoding 
b(w) = h w (Q). Word w contains no (^-j-) + -powers. 

7. Final Remarks 

Our result builds on that of [13j, but uses somewhat simpler arguments, taking advantage 
of properties of our specific morphisms. In addition, we have specified bounds for the various 
computer checks, rather than invoking mere decidability. 

A large simplification results from the fact that our morphisms give binary words with no 
kernel repetitions at all (even of small exponent). When moving from PE to irn in Section [5] 
one can give the relationship between the exponents of these two kernel repetitions. 

\PE\ \tX7]\ \n'\ 

\P\ I -7T | r|7r| 

If it takes s steps to arrive from repetition PE to a repetition tttj with non-markable excess, 
then the exponents differ by 

i /i s 

k z — ' 

1 1 i=i 

In the notation of [13], PE corresponds to /i s (7T, if), and has the largest exponent among the 
{i 1 (tt,i]), < i < s. Unfortunately, [13] is marred by getting this backward, saying that for 
uniform morphisms the largest exponent occurs either for i = or for i = 1! 

In fact, for the morphisms given for n = 5,6, 7, rf is empty, so the aforementioned reversal 
has no effect. However, for 8 < n < 11, rf is non-empty, and a more complicated check than 
indicated in [TJ] is necessary to ensure that the given constructions work. Happily, they do 
indeed work, as a more careful check shows. 

Finally, we mention a few points regarding the search strategy for finding morphisms. The 
second step of the strategy indicated in [13] calls for enumerating all candidate morphisms of 
short enough length. A priori, this involves enumerating all binary words of length at most 
r which are Pansiot encodings of (^-j-) + -free words over S n . Initially this was part of our 
strategy. Unfortunately, our experience supports the conjecture in [15], that the number of 
these words grows approximately as 1.24 r (independently of n.) 

For successive r values we looked at all possible pairs (h(0), h(l)) such that |/i(0)|, \h(l)\ < 
r where h(0),h(l) were Pansiot encodings of (^-j-) + -free words and satisfied the algebraic 
condition; this allowed us to verify the claim of [13] that the morphisms presented therein for 
5 < n < 11 are shortest possible 'convenient morphisms'; the uniforms are all uniform, with 
lengths around 4n — 4 in each case. However, storing all legal Pansiot encodings up to length 
An — 4 fills up a laptop with 2G RAM at around n = 15. Therefore, our search program had 
to migrate to computers with more and more RAM, simply to store Pansiot encodings. On 
the plus side, we found a great number of 'convenient morphisms' for 12 < n < 17, not just 
the ones presented in this paper. 

To find morphisms for n up to 26 (and indeed for various other higher values of n) we 
adopted a different strategy. Using backtracking, we found legal Pansiot encodings of length 
exactly r = An — 4 (or r = An, in the case n = 21), but only saved encodings v for which 
the permutation a(v) was an r-cycle (and thus a candidate for ft-(l)) or an (r — l)-cycle (and 
thus a candidate for h(0)). As soon as a candidate for h{i) was found, it was tested together 
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with each previously found candidate for h(l — i) to see whether a 'convenient morphism' 
could be formed, in which case the search terminated. This search used very little memory, 
and terminated quickly. For n = 26, our C ++ code found the morphism in just over 6 hours. 
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